Interactive User-Feedback for Sound Source Separation
نویسنده
چکیده
Copyright is held by the author/owner(s). IUI’13, March 19–22, 2012, Santa Monica, California, USA. This work was performed while interning at Adobe Research. Abstract Machine learning techniques used for single-channel sound source separation currently offer no mechanism for user-feedback to improve upon poor results and typically require isolated training data to perform separation. To overcome these issues, we present work that applies interactive machine learning principles to incorporate continual user-feedback into the source separation process. In particular, we allow end-users to annotate errors found in source separation estimates by painting on time-frequency displays of sound. We then employ a posterior regularization technique to make use of the annotations to obtain refined source separation estimates and repeat the process until satisfied. An initial prototype shows that the proposed method can significantly improve separation quality compared to previous work and facilitate separation without isolated training data.
منابع مشابه
An Efficient Posterior Regularized Latent Variable Model for Interactive Sound Source Separation
In applications such as audio denoising, music transcription, music remixing, and audiobased forensics, it is desirable to decompose a single-channel recording into its respective sources. One of the current most effective class of methods to do so is based on nonnegative matrix factorization and related latent variable models. Such techniques, however, typically perform poorly when no isolated...
متن کاملSource Separation of Polyphonic Music with Interactive User-Feedback on a Piano Roll Display
The task of separating a single recording of a polyphonic instrument (e.g. piano, guitar, etc.) into distinctive pitch tracks is challenging. One promising class of methods to accomplish this task is based on non-negative matrix factorization (NMF). Such methods, however, are still far from perfect. Distinct pitches from a single instrument have similar timbre, similar note attacks, and contain...
متن کاملResponses in light, sound and scent: a therapeutic interactive yoga system
We describe an interactive system that uses gesture recognition to enhance the yoga experience through visual, auditory and olfactory feedback. Ancient theories associated with Kundalini yoga provide the theoretical basis for this research. The sensory feedback provided by the Therapeutic Interactive Yoga System promotes an immersive, multisensory experience that corresponds to the system of se...
متن کاملA Case Study on Adaptability Problems of the Separation of User Interface and Application Semantics
A large number of software architectures for interactive have been described in literature, like the Seeheim, PAC-Amodeus, and Model-View-Controller architectures. Most of these architectures are based on the traditional view of interactive software, namely the view that an interactive software system can be separated in an application part and a user interface part. The application part contai...
متن کاملAudiovisual installation: interactive imprinting of sound in water
The present project is an interactive audiovisual creation, a sound and vision installation where the user interactively activates sound in its analog form. The sound is used to influence a physical medium (in this case a tank full of water) producing an image through the turbulences on its surface. This image, properly illuminated, is reflected and visualised in space. The installation is desi...
متن کامل